Entropy and Dynamism Criteria for Voice Quality Classification Applications
نویسندگان
چکیده
We describe the voice quality classification system that uses entropy and dynamism criteria as discrimination features. The main idea of this approach is that the input neural net is considered as an informational channel. Channel tuned to the certain type of information transmits it best of all according to the informational criterion. In our case a multilayer perceptron (MLP) emitted posterior probabilities for speech recognition was used as such information channel. Then two features entropy and dynamism were computed using these posterior probabilities. And finally HMM was used as a classifier. Different experiments demonstrated efficient usage possibilities of entropy and dynamism criteria not only in audio classification tasks but also in the voice quality classification applications.
منابع مشابه
Evaluation of Google’s Voice Recognition and Sentence Classification for Health Care Applications
This study examined the use of voice recognition technology in perioperative services (Periop) to enable Periop staff to record workflow milestones using mobile technology. The use of mobile technology to improve patient flow and quality of care could be facilitated if such voice recognition technology could be made robust. The goal of this experiment was to allow the Periop staff to provide ca...
متن کاملAudio Documents Analysis And Indexing: Entropy and Dynamism Criteria
This work relates to the solution of real multimedia documents indexing tasks. New approach to high-performance speech/music segmentation system creation is considered. This approach uses wavelet analysis and it is based not only on new distinctive features (entropy and dynamism) investigation, but it also takes into account information concerning language. Different experiments have shown the ...
متن کاملSpeech/music segmentation using entropy and dynamism features in a HMM classification framework
In this paper, we present a new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news. In the approach presented here, an artificial neural network (ANN) trained on clean speech only (as used in a standard large vocabulary speech recognition system) is used as a channel model at the output of which the entropy a...
متن کاملFault diagnosis of gearboxes using LSSVM and WPT
This paper concentrates on a new procedure which experimentally recognises gears and bearings faults of a typical gearbox system using a least square support vector machine (LSSVM). Two wavelet selection criteria Maximum Energy to Shannon Entropy ratio and Maximum Relative Wavelet Energy are used and compared to select an appropriate wavelet for feature extraction. The fault diagnosis method co...
متن کاملStudying Influence of Preheating Conditions on Design Parameters of Continuous Paint Cure Ovens
This paper concentrates on a new procedure which experimentally recognises gears and bearings faults of a typical gearbox system using a least square support vector machine (LSSVM). Two wavelet selection criteria Maximum Energy to Shannon Entropy ratio and Maximum Relative Wavelet Energy are used and compared to select an appropriate wavelet for feature extraction. The fault diagnosis method co...
متن کامل